Impact of I/O and Data Management in Ensemble Large Scale Climate Forecasting Using EC-Earth3

نویسندگان

  • Muhammad Asif
  • Andrés Cencerrado
  • Oriol Mula-Valls
  • Domingo Manubens
  • Francisco Doblas-Reyes
  • Ana Cortés
چکیده

The EC-Earth climate model is a seamless Earth System Model (ESM) used to carry out climate research in 24 academic institutions and meteorological services from 11 countries in Europe. This model couples several components and it is continuously under development. In this work we present a study regarding the impact of the I/O and data management when using EC-Earth in well-known supercomputing environments. Most large-scale and long-term climate simulators have been developed bearing in mind the paramount importance of its scalability. However, the computational capabilities of the High Performance Computing (HPC) environments increase at so great speed that it is almost impossible to re-implement the whole models so that they are able to exploit efficiently the new features. Therefore, it is necessary to design different strategies to take advantage of them. In this work we present an operational framework to run ensemble simulations in HPC platforms. A set of experiments are presented in order to validate the suitability of this technique. Moreover, the derived impact regarding the I/O and data management aspects is analyzed. EC-Earth, the European Community Earth System Model EC-Earth is a seamless Earth System Model (ESM) selected by IC3 to perform climate research as end users and also as part of the EC-Earth consortium. In this study EC-Earth v3.0.1 was used and here is below a brief description of EC-Earth components: • IFS is ECMWF’s Integrated Forecast System (atmospheric component for ESM). • NEMO the Nucleus for European Modelling of the Ocean (oceanic component for ESM) not only includes the ocean circulation but also sea-ice and biogeochemistry. • OASIS3 a coupling software used to couple IFS and NEMO with resolutions ranging from approximately 128 to 25 kilometers. Figure 1: Example of Sea Surface Temperature forecast for one month (mean values, Kelvin scale) using the standard resolution (1 degree) of the ocean component NEMO. Here are below the detailed list of parameters in conncection to different configurations used for this study where T159, T255 and T799 are IFS configurations and refer to approximately 125km, 80km and 25km grid-spacing respectively. On the other hand; ORCA1 and ORCA025 are NEMO configurations and refer to 110km and 25km: Parameter T159-ORCA1 T255-ORCA1 T799-ORCA025 IFS time step 1 hour 1 hour 12 minutes IFS output frequency 6 hours 6 hours 6 hours NEMO time step 1 hour 1 hour 20 minutes NEMO output frequency 1d, 5d, 1m, 1y 1d, 5d, 1m, 1y 1d, 5d, 1m, 1y OASIS3 coupling frequency 3 hours 3 hours 3 hours Output size (1-month run) 0.9 GB approx. 1.6 GB approx. 21.4 GB approx. Restart size 1.4 GB approx. 2.3 GB approx. 25 GB approx. Table 1: Parameters of different EC-Earth3 configurations Large Scale Climate Simulations in HPC Environments Each configuration detailed in last section has been used in different HPC platforms. Subsequently, their main features are presented, where a) corresponds to the platform used with T159-ORCA1, b) the one used with T255-ORCA1, and c) the one used with T799-ORCA025: a) MareNostrum2 (Barcelona Supercomputing Center, Spain): IBM BladeCenter JS21 with IBM Power PC 970MP processors (2.3GHz) and Myrinet interconnect technology. b) Lindgren (PDC Center for High-Performance Computing, Sweden): Cray XE6 system, based on AMD 12-core Magny-Cours Opteron (2.1 GHz) processors and Cray Gemini interconnect technology. c) Jaguar (Oak Ridge National Laboratory, USA): Cray XT5 system, based on AMD 6-core Istanbul Opteron (2.6 GHz) processors and InfiniBand network. Figure 2: EC-Earth simulation features for each configuration at Lindgren, PDC. A well-known goal among the climate research community is to be able to simulate 10 years per wall clock day, which means to simulate approximately 1 hour per second. In Figure 2 the detailed scalability analysis made with respect to Lindgren shows that the current community ambitions are not being met clearly and obviously there is high need to exploit HPC environment further. Operational Framework to Run Ensemble Simulations Due to scalability limit of the ESM; as that could also be seen in Figure 3 it seems testing and validating the option of running ensemble simulations could be helpful in explotiting the utilization of HPC power in an efficient way. Figure 3: Left: Elapsed times of the T799-ORCA025 configuration as a function of the total number of processors for 5-day simulations, running in Jaguar. Right: Scaling of the T799-ORCA025 configuration as a function of the total number of processors for 5-day simulations. Blue, orange and yellow curves correspond to results for tests with NEMO using 384, 576 and 768 CPUs. Speedup values are relative to the reference test (778 processors). Figure 4: Sample experiment setup. Figure 5: Average simulation times as a function of the number of jobs wrapped using configurations T159-ORCA1 (run in MareNostrum2) and T255-ORCA1 (run in Lindgren). Green portions show communication times. Figure 6: Top: average simulation times as a function of the number of jobs wrapped using configurations T799ORCA025 (run in Jaguar). Green portions show communication times. Bottom: percentage of execution times, including filesystem access time Conclusions • EC-Earth3 presents a good scalability upto a few thousand processors only. • A wrapping technique is exploited and validated where many independent simulations are run as a big single job. • I/O does not exihibit the bottleneck with wrapping technique but in future it may pose issues as if the model’s scalability is improved or the frequency of I/O time steps increases. • While management of the data produced with this technique seems to be challenging. Acknowledgements This study was supported by the European Unions FP7-funded SPECS (308378) and IS-ENES2 (312979) projects, the MICINN-Spain under contract TIN2011-28689-C02-01 and the Catalan Government. It used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory through the grant CLI055, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Forecasting the effects of climate change on the climatic of the Miahne-city using climate models (SDSM)

The purpose of this study is to determine and predict the effects of climate change on architecture using climate models. By clarifying the consequences of the impact of urban architecture on climate change while using the data of surrounding stations with library and field data methods. The climatic parameters of precipitation, temperature and relative humidity of the synoptic station of Miyan...

متن کامل

Forecasting of rainfall using different input selection methods on climate signals for neural network inputs

Long-term prediction of precipitation in planning and managing water resources, especially in arid and semi-arid countries such as Iran, has a great importance. In this paper, a method for predicting long-term precipitation using weather signals and artificial neural networks is presented. For this purpose, climatic data (large-scale signals) and meteorological data (local precipitation and tem...

متن کامل

Seasonality in Tourism and Forecasting Foreign Tourist Arrivals in India

In the present age of globalization, technology-revolution and sustainable development, the presence of seasonality in tourist arrivals is considered as a key policy issue that affects the global tourism industry by creating instability in the demand and revenues. The seasonal component in a time-series distorts the prediction attempts for policy-making. In this context, it is quintessential to...

متن کامل

Future climate change impact on hydrological regime of river basin using SWAT model

Hydrological components in a river basin can get adversely affected by climate change in coming future. Manipur River basin lies in the extreme northeast region of India nestled in the lesser Himalayan ranges and it is under severe pressure from anthropogenic and natural factors. Basin is un-gauged as it lies in remote location and suffering from large data scarcity. This paper explores the imp...

متن کامل

Simulation of the climate change impact on monthly runoff of Dez watershed using IHACRES model

Identification and analysis of flow fluctuations in consequences of climate change is one of the most important factors in water resources management planning and this is vital especially in areas where large crowds are engaged in agriculture. Dez watershed, as an agricultural hub in the country, is one of areas that river flow fluctuations caused by climate change can affect a large population...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014